
Europalsches Patentamt 

® flSSa European Patent Office @ Publication number: 0036 776 

A€fJM AO 
Office europ6en des brevets 



® EUROPEAN PATENT APPLICATION 



@ Application number: 81301227.5 @ int.Cl.vC12N 15/00 

//C12P21/00 

@ Date o( filing: 23.03.81 



Priority: 24.03.B0 US 133236 



Date of publication of application: 30.03.81 
Bulletin 81/39 



Designated Contracting States; AT BE CH DE FR GB n* 
LI LU NLSE 



@ Applicant: GENENTECH. INC., 460 Point San Bruno 
Boulevard, So. San Francisco California 94080 (US) 



® 



Inventor: Kleld, Dennis G., 724 Costa Rica Avenue, San 
Mateo California 94404 {US) 

Inventor: Yansure, Daniel G., 125 Ploche, San Francisco 
California 94134 (US) 

Inventor; Heyneker, Herbert U 2621 Easton Drive, 
Burllngama California 94010 (US) 
Inventor: Mlozzari, Giuseppe F., IM Feverbusch 9, 
Aigarten CH-4310 Rhelnfelder (CH) 



@ Representative: Armltage. Ian Michael et al, MEWBURN 
ELLfS & CO. 70/72 Chancer)' Lane, London V/C2A 1 AD 
(GB) 



@ A method of producing a polypeptide product end a plasmldic expression vehicle therefor, a method of creating en 
expression plasmid, a method of cleaving double stranded DMA, end specific plasmlds. 

@ Novel pias.Tiidic expression vehicles and methods of 
using Ihem in the production of useful polypeptides by 
recombinant bacteria are described. The plasmids employ a 
tryptophan promoter-operator system from which the 
attenuator region ordinarily present has been deleted. Bac- 
teria containing-the plasmids can accordingly be repressed 
by the addition of tryptophan against expression of desired 
polypeptides coded for by inserted genes while they are 
grown to levels suitable for industrial-scale production. 
Additive tryptophan may then be withdrawn, essentially 
derepressing the pathway and permitting efficient produc- 
tion of the desired product in high yield. 
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A METHOD OF PRODUCING A P Or.YPEPTIDE PRODUCT 
AND A PIASMIDIC EXPRESSICM VEHTOX THEREFOR, 
A MEIHOD OF CREATING AN F3CPRESSICN PIA31ID, 
A bETHOD OF CUEAVING VO^ mm STRAITOP Ui^. 

ftND spb::ific piasmids. 



BACKGROUND OF THE INVENTION 

With the advent of recombinant DNA technology, the controlled 
bacterial production of an enormous variety of useful polypeptides has 
become possible. Already in hand are bacteria modified by this 
technology to .permit the production of such polypeptide products such as 
somatostatin (K. Itakura, et ai- , Science 198, 1056 ;:i977]), tne 
15 (component) A and B chains of human insulin (D.V. Goeddel, et al- , Proc 
Nafl Acad Sci, USA 75, 106 [1979]), and human growth hormone (D.V. 
Goeddel, et al.. Nature 28^, 544 [1979]). More recently, recombinant 
O.NA techniques have been used to occasion the bacterial production of 
thymosin alpha 1, an immune potentiating substance produced by the 
20 thymus. Such is the power of the technology that virtually 
any useful polypeptide can be bacterially produced, putting 
within reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide variety 
of diseases. The cited materials, which describe 
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in greater detail the representative examoles referred to above, are 
incorporated herein by reference, as are other publications referred to 
infra , to illuminate the background of the invention. 

The work horse of recombinant DNA technology is the plasmid, a- 

5 ■ non-chromosomal loop of double-stranded DNA found in bacteria, 

oftentimes in multiple copies per bacterial cell. ' Included in the 
information encoded in the.plasmid DNA is that required to reproduce the 
plasmid in daughter cells (i.e,, a "replicon") and ordinarily, one or 
more selection characteristics, such as resistance to antibiotics, which 

10 permit clones of the host cell containing the plasmid of interest to be 
recognized and preferentidlly grown in selective media. The utility of 
bacterial plasmids lies in the fact that they can be specifically 
cleaved by one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the plasmidic 

15 DNA. Thereafter heterologous genes or gene fragments may be inserted 
into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent the cleavage site. As used herein, the term 
"heterologous" refers to a gene not ordinarily found in, or a 
polypeptide sequence ordinarily not produced by, col i , whereas the 

20 term "homologous" refers to a gene or polypeptide which is produced in 
wild-type E_. col i . DNA recombination is performed outside the bacteria, 
but the resulting "recombinant" plasmid can be introduced into bacteria 
by a process known as transformation and large quantities of the 
heterologous gene-containing recombinant plasmid obtained by growing the 

25 transformant. Moreover, where the gene is properly inserted with 

reference to portions of the plasmid which govern the transcription and 
translation of the encoded DNA message, the resulting expression vehicle 
can be used to actually produce the polypeptide sequence for which the 
inserted gene codes, a process referred to as expression. 

30 Expression is initiated in a region known as the promoter which is 

recognized by and bound by RNA polymerase. In some cases, as in the trp 



.-^^ i": - 3 m'vc"; i'^:: :^ecc':^ ^ "^y so-cd^^eo repressor ::ra:,rrn^ 
35 serve to regulate the frequency of transcription initiation at a 
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particular promoter. The polymerase travels along the DNA, transcribing 
the information contained in the coding strand from its 5' to 3* end 
into messenger RNA which is In turn translated into a polypeptide having 
the amino acid sequence for which the DMA codes. Each amino acid is 

5 ■ encoded by a unique nucleotide triplet or "codon" within what '"^y for 
present purposes be referred to as the "structural gene", i.e. that part 
v^ich encodes the amino acid sequence of the expressed product. After 
binding to the promoter, the RNA polymerase first transcribes 
nucleotides encoding a ribosome binding site, then a translation 

10 initiation or '*start" signal (ordinarily ATG, which in the resulting 
messenger RNA becomes AUG)^ then the nucleotide codons within the 
structural gene itself. So-called stop codons are transcribed at the 
end of the structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of the presence of 

15 the stop signal, will remain untranslated by the ribosomes. Ribosomes 
bind to the binding site provided on the messenger RNA, in bacteria 
ordinarily as the mRNA is being formed, and themselves produce the 
encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 

20 produced if the sequences encoding the ribosome binding site are 

positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon in phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other bacterial protein. 

25 Polypeptides expressed through the use of recombinant DNA 

technology may be entirely heterologous, as in the case of the direct 
expression of human growth hormone, or alternatively may comprise a 
heterologous polypeptide and, fused thereto, at least a portion of the 
amino acid sequence of a homologous peptide, as in the case of the 

30 production cf intermediates for somatostatin and the components of human 
insulin. In the latter cases, for example, the fused homologous 

0 1 0 I nac 1 1 vdieo zne ruseo, nG:-iu iui^ou- po , j:-^^ : t"- 
35 cleaved away in an extracellular environment. Fusion proteins like 
^-h-^rp j'jst ^^ntioned can be designed so as to oermit highly specific 
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cleavage of the precusor protein from the intended product, as by the 
action of cyanogen bromide on methionine, or alternatively by enzymatic 
cleavage. See, eg., G.B. Patent PuDlication No. 2 007 576 A. 

If recombinant DNA technology is to fully sustain its promise, 
5 systems must be devised which optimize expression of gene inserts, so 
that the intended polypeptide products can be made available in high 
yield. The beta lactamase and lactose promoter-operator systems most 
commonly used in the past, while useful, have not fully utilized the 
capacity of the technology from the standpoint of yield. A need has 
10 existed for a. bacterial expression vehicle capable of the controlled 
expression of desired polypeptide products in higher yield. 

Tryptophan is an amino acid produced by bacteria for use as a 
component part of homologous polypeptides in a biosynthetic pathway 
which proceeds: chorismic acid anthrani 1 ic acid-^phosphoribosyl 

15 anthranilic acid — > CORP [enol-l-(o-carboxyphenylamino)-l-desoxy-D- 
ribulose-5-phosphate]-^ indol-3-glycerol-phosphate, and ultimately to 
tryptophan itself. The enzymatic reactions of this pathway are 
catalyzed by the products of the tryptophan or "trp" operon, a 
polycistronic DNA segment which is transcribed under the direction of 

20 the trp promoter-operator system. The resulting polycistronic messenger 
RNA encodes the so-called trp leader sequence and then, in order, the 
polypeptides referred to as trp E, trp D, trp C, trp B and trp A. These 
polypeptides variously catalyze and control individual steps in the 
pathway chorismic acid tryptophan. 

25 In wild-type _E. co1 i , the tryptophan operon is under at least three 

distinct forms of control. In the case of promoter-operator repression, 
tryptophan acts as a corepressor and binds to its aporepressor to form 
an active repressor complex which, in turn, binds to the operator, 
closing down the pathway in its entirety. Secondly, by a process of 

30 feedback inhibition, tryptophan binds to a complex of the trp E and trp 

region within the trp leader sequence. See generally 6.F. Miozzari 
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et a^, J, Bacteriology 133 , 1457 (1978); The Qperon 263-302, Cold Spring 
Harbor Laboratory (1978), Miller and Reznikoff, ^ds.; F. Lee et al_. 
Proc. Natl. Acad. Sci, USA 74, 4365 (1977) and K. Bertrand et aJ, J, 
Mol. Biol. J^, 319 (1975), The extent of attenuation appears to be 
5 • governed by the intracellular concentration of tryptophan, and in 

wild-type coli the attenuator terminates expression in approximately 
nine out of ten cases, possibly through the formation of a secondary 
structure, or "termination loop", in the messenger RNA which causes the 
RNA polymerase to prematurely disengage from the associated DMA. 



10 Other workers hdve employed the trp operon to obtain some measure 

of heterologous polypeptide expression. This work, it is believed, 
attempted to deal with problems of repression and attenuation by the 
addition of -indole acrylic acid, an inducer and analog which competes 
with tryptophan for trp repressor molecules, tending toward derepression 

15 by competitive inhibition. At the same time the inducer diminishes 
attenuation by inhibiting the enzymatic conversion of indole to 
tryptophan and thus effectively depriving the cell of tryptophan. As a 
result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of 

20 completing translation consistently and in high yield, since 

tryptophan-containing protein sequences are prematurely terminated in 
synthesis due to lack of utilizable tryptophan. Indeed, an effective 
relief of attenuation by this approach is entirely dependent on severe 
tryptophan starvation. 

25 The present invention addresses problems associated with tryptophan 

repression and attenuation in a different manner and provides (1) a 
method for obtaining an expression vehicle designed for direct 
expression of heterologous genes from the trp promoter-operator, (2) 
methods for obtaining vehicles designed for expression, from the 

30 tryptophan operator-promoter, of specifically cleavable polypeptides 
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SUMMARY OF THE INVENTION 



According to the present Invention, novel plasmidic expression 
vehicles are provided for the production in bacteria of heterologous 
polypeptide products, the vehicles having a sequence of double-stranded 

5 DNA comprising, in phase from a first 5' to a second 3' end of the 
coding strand, a trp promoter-operator, nucleotides coding for the trp 
leader ribosome binding site, and nucleotides encoding translation 
initiation for expression of a structural gene that encodes the amino 
acid sequence of the heterologous polypeptide. The DNA sequence referred 

10 to-contains neither a trp attenuator region nor nucleotides coding for 
the trp E ribosome binding site. Instead, the trp leader ribosome 
binding site is efficiently used to effect expression of the information 
encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator- 

15 containing and attenuator-lacking plasmids of the invention and grown up 
in the presence of additive tryptophan. The use of tryptophan-rich 
media provides sufficient tryptophan to essentially completely repress 
the trp promoter-operator through trp/repressor interactions, so that 
cell growth can proceed uninhibited by premature expression of large 

20 quantities of heterologous polypeptide encoded by an insert otherwise 
under the control of the trp promoter-operator system. When the 
recombinant culture has been grown to the levels appropriate for 
industrial production of the polypeptide, on the other hand, the 
external source of tryptophan is removed, leaving the cell to rely only 

25 on the tryptophan that it can itself produce. The result is mild 

tryptophan limitation and, accordingly, the pathway is derepressed and 
highly efficient expression of the heterologous insert occurs, 
unhampered by attenuation because the attenuator region has been deleted 
from the system. In this manner the cells are never severely deprived 

'^n of trvDtoDhan and all proteins, whether they contain tryptophan or not. 



ONA at any desired point, even absent a restriction enzyme site, a 
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technique useful in, cuong other things, the creation of trp operons 
having attenuator deletions other than those previously obtained by 
selection of mutants- 

Finally, the invention provides a variety of useful intermediates 
and endproducts, including specifically cleavable heterologous- 
homologous fusion proteins that are stabilized against degradation under 
expression conditions* 

The manner in which these and other objects and advantages of the 
invention are obtained will become mnrp apparent from the detailed 
description which follows and from the accompanying drawings in which: 

Figures 1 and 2 Illustrate a preferred scheme for forming plasmids 
capable of expressing heterologous genes as fusions with a 
portion of the trp D polypeptide, from which fusion they may 
be later cleaved; 

Figure 3 is the result of polyacryl amide gel segregation of cell 
protein containing homologous (trp D*) - heterologous 
(somatostatin or thymosin a 1) fusion proteins; 

Figures 4, 5 and 5 illustrate successive stages in a preferred 
scheme for the creation of a plasmid capable of directly 
expressing a heterologous gene (human growth hormone) under 
the control of the trp promoter-operator system; 

Figure 7 is the result of polyacrylamide gel segregation of cell 
protein containing human growth hormone directly expressed 
under the control of the trp promoter-operator system; 

Figures 8,9 (a-b) and 10 illustrate in successive stages a 

preferred scheme for the creation of plasmids capable of 
expressing heterologous genes (in the illustrated case, for 
somatostatin) as fusions with a portion of the trp E 
polypeptide, from which fusions they may be later cleaved; 

thymosin aloha i, hunan proinsjlin, ancJ the A and 3 cnayns or 
human insul in. 
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Figures 12 and 13 illustrate in successive stages the manner in 
which the plasmid created by the scheme of Figures 3-10 
inclusive is manipulated to form a system in which other 
heterologous genes may be interchangeably expressed- as fusions 
with trp E polypeptide sequences. 



In the Figures, only the coding strand of the double-stranded 
plasmid and linear DNAs are depicted in most instances, for clarity in 
illustration. Antibiotic resistance-encoding genes are denoted ap' 

q S 

(ampicillin) and tc (tetracycline). The legend tc connotes a gene 
10 for tetracycline resistance that is not under the control of a 

promoter-operator system, such that plasmids containing the gene will 
•nevertheless be tetracycline sensitive. The legend "ap^" connotes 

ampicillin sensitivity resulting from deletion of a portion of the gene 

encoding ampicillin sensitivity. Plasmidic promoters and operators are 
15 denoted "p" and "o". The- letters A, T, G and C respectively connote the 

nucleotides containing the bases adenine, thymine, guanine and 

cytosine. Other Figure legends appear from the text. 

The preferred embodiments of the invention described below involved 
use of a number of commonly available restriction endonucleases next 
20 identified, with their corresponding recognition sequences and 
(indicated by arrow) cleavage patterns. • 



Xbal: 




TaqI: 



TCGA 

AGCT 
T 

AAGCTT 



EcoRI: 



AGATCjT 

GAATTC 



Hindlll : 



25 



CTTAAG 
t 

AGATCT 



TTCGAA 



Bglll: 



Hpal: 



GTTAAC 



TCTAGA 
t 



PstT : 




30 



oair.HI ; 



GGATCC 



CCTAGG 

t 
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Where the points of cleavage are spaced apart on the respective strands 
the cleaved ends will be "sticky", ie, capable of reannealing or of 
annealing to other complementari ly "sticky^-ended DNA by Watson-Crick 
base pairing (A to T and G to C) in mortise and tenon fashion. Some 
5 restriction enzymes, such as Hpal and PvuII above, cleave to leave 
"blunt" ends. The nucleotide sequences above are represented in 
accordance with the convention used^throughout : upper strand is the 
protein encoding strand, and in proceeding from left to right on that 
strand one moves from the 5' to the 3' end thereof, ie, in the direction 
10 of transcription from a "proximal" toward a "distal" point. 

Finally with regard to conventions, the symbol "a" connotes a 
deletion. Thus, for example, reference to a plasmid followed by, say, 
"AEcoRI-Xbar* describes the plasmid from which the nucleotide sequence 
between EcoRI and Xbal restriction enzyme sites has been removed by 
15 digestion with those enzymes. For convenience, certain deletions are 
denoted by number. Thus, beginning from the first base pair ("bp") of 
the EcoRI recognition site which precedes the gene for tetracycline 
resistance in the parental plasmid pBR322,- "aI" connotes deletion of 
bpl-30 (ie, AEcoRI-Hind III) and consequent disenabling of the 
20 tetracycline promoter-operator system; "a2" connotes deletion of bp 1-375 
(ie, AEcoRI-BamHI) and consequent removal of both the tetracycline 
promoter-operator and the structural gene which encodes tetracycline 
resistance; and "a3" connotes deletion of bp 3611-4359 (ie, APstl-EcoRI) 
and el imination of ampicillin resistance, "a4" is used to connote 
25 removal of bp -900 --1500 from the trp operon fragment 5_ (Fig. 1), 
eliminating the structural gene for the trp D polypeptide. 



DETAILED DESCRIPTION 



The trp leader sequence is made up of base pairs ("bp'*) 1-162, 
starting from the start point for trp mRNA. A fourteen amino acid 




lying between bp 114 and 156 and attenuation is apparently effected on 
^^^^'A r,<Jr^ont^cies encoded by bo of the leader sequence. To 
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express a heterologous polypeptide under the direction of the trp leader 
ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

!• Base pairs 134-141 or beyond must be deleted; 
5 2. The ATG codon of the inserted gene must be positioned in 

correct relation to a ribosome binding site, as is known (see, 
eg-, J.A. Steitz "Genetic signals and nucleotide sequences in 
messenger RNA" in Biological Regulation and Control (ed» R. 
Goldberger) Plenum Press, N^Y. (1978). 
Id 3. Where a homologous-heterologous fusion protein is to be 

produced, the translation start signal of a homologous 
polypeptide sequence should remain available, and the codons 
for the homologous portion of the fusion protein have to be 
inserted in phase without intervening translation stop signals. 
15 For example, deleting all base pairs within the leader sequence 

distal from.bp* 70 removes the attenuator region, leaves the ATG 
sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69-71), by eliminating 
A and following nucleotides. Such a deletion would result in expression 
20 of a fusion protein beginning with the leader polypeptide, ending with 
that encoded by any heterologous insert, and including a distal region 
of one of the post-leader trp operon polypeptides determined by the 
extent of the deletion in the 3' direction. Thus a deletion extending 
into the*. E gene would lead to expression of a homologous precursor 
25 comprising the L sequence and the distal region of E (beyond the 
deletion endpoint) fused to the sequence encoded by any following 
insert, and so on. 

Two particularly useful plasmids from which the attenuator region 
has been deleted are the plasmids pGMl and pGM3, G.F, Miozzari et al , 
30 J. Bacteriology 133 , 1457 (1978)- These respectively carry the 

deletions trp aLE 1413 and trp iLE 1417 and express (under the control 
of the trp promoter-operator) a polypeptide comprising approximately the 
'* ^ -.r^-r^^ -^^^r fho +--10 ^ap^r]or and distal regions of the E 



Lha cistal one half of the i pciypepiide coaons. c_, co li \-'.c sird\r^ 
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W3110 tna 2"trp"fil02 containing pGMl has been deposited with the 
American Type Culture Collection (ATCC no. 31522). pGMl may be 
conventionally removed from the strain for use in the procedures 
described below. 

5 Alternatively, deletions may be effected by means provided by the 

invention for spee^ifically cleaving double-stranded ONA at any desired 
site. One example of this cleavage technique appears from Part IV of 
the experimental section, infra . Thus, double-stranded DMA is converted 
to single-stranded DNA in the region surrounding the intended cleavage 

10 point, as by reaction with lambda exonuclease. A synthetic or other 
single-stranded DNA primer is then hybridized to the single-stranded 
length earlier formed, by Watson-Crick base-pairing, the primer sequence 
being such as to ensure that the 5' end thereof will be coterminous with 
the nucleotide on the first strand just prior to the intended cleavage 

15 point. The primer is next extended in the 3' direction by reaction with 
DNA polymerase, recreating that portion of the original double-stranded 
DNA prior to the intended cleavage that was lost in the first step. 
Simultaneously or thereafter, the portion of the. first strand beyond the 
intended cleavage point is digested away. To summarize, where "v" marks 

20 the intended cleavage point: 



a) 



intended cleavage point "v" 



_v made single stranded 

around "v" 

25 V primer hybridization 



d) 



extension from primer 



30 
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In the most preferred embodiment, steps (d) and (e) are performed 
simultaneously, using a polymerase that simultaneously digests the 
protruding single stranded end in the 3V> 5' direction and extends the 
primer (in the presence of dATP, dGTP. dTTP and dCTP) in the 5' > 3* 

5 direction. The material preferred for this purpose is Klenow Polymerase 
I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' > 3' polymerizing activity and the 3' > 5' 
exonucleolytic activity of the parental enzyme, yet lacks its 5* > 3' 
exonucleolytic activity. A. Kornberg, DNA Synthesis , 98, W.H. Freeman 

10 and Co., SFO (1974). 

Using the procedure just described, attenuator deletions may be 
made in any desired manner in a trp operon-containing plasmid first 
linearized by, eg, cleavage at a restriction site downstream from the 
point at which the molecule is to be blunt-ended ("v" above). 
15 Recircularization following deletion of the attenuator region may be 
effected, eg, by blunt end ligation or other manners which will be 
apparent to the art-skilled.- 

Although the invention encompasses direct expression of 
heterologous polypeptide under the direction of the trp promoter- 

20 operator, the preferred case involves expression of fused proteins 
containing both homologous and heterologous sequences, the latter 
preferably being specifically cleavable from the former in 
extra-cellular environs. Particularly preferred are fusions in which 
the homologous portion comprises one or more amino acids of the trp 

25 leader polypeptide and about one-third or more of the trp E amino acid 
sequence (distal end). Fusion proteins so obtained appear remarkably 
stabilized against degradation under expression conditions. 

Bacteria E_. coli K-12 strain W3110 tna 2'trp~Al02 (pGMl), ATCC 
No. 31622, may be used to amplify stocks of the pGMI plasmid preferably 

- ■■ ^ ' — ^ ^ +- nr VI .^+-^r-Hn Fir -lent lto p romo t e r-0 d 6 ra 1 0 t 



supplemented with 50 ug/ml anthrani late. 
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All bacterial strains used in trp promoter-operator directed 
expression according to the invention are trp repressor"*" ("trp R^") 
as in the case of wild-type £. col i , so as to ensure repression until 
heterologous expression is intended. 

5 DNA recombination is, in the preferred embodiment, performed in 

E_. coli , K-12 strain -294 (end A, thi", hsr", hsmj^), ATCC No. 
31446, a bacterial strain whose membrane characteristics facilitate 
transformations. Heterologous polypeptide-producing plasmids grown in 
strain 294 are conventionally extracted and maintained in solution (eg, 

10 lOmM trU, ImiM EDTA,pH8) at from about -20*C to about ^"C. For 

expression under industrial conditions, on the other hand, we prefer a 
more hardy strain, ie, E_. coli K-12 x'F" RV 308 str^, gal 308" 
ATCC No- 31608. RV 308 is nutritionally wild-type and grows well in 
minimal media, synthesizing all necessary macromolecules from 

15 conventional mixes of ammonium, phosphate and magnesium salts, trace 
metals and glucose. After transformation of RV 308 culture with strain 
294-derived p'lasmid the culture is plated on media selective for a 
marker (such as antibiotic resistance) carried by the plasmid, and a 
transformant colony picked and grown in flask culture. Aliquots of the 

20 latter in 10% DMSO or glycerol solution (in sterile Wheaton vials) are 
shell frozen in an ethanol-dry ice bath and frozen at -80^*0. To produce 
the encoded heterologous polypeptide the culture samples are grown up in 
media containing tryptophan so as to repress the trp promoter-operator 
and .the system then deprived of additive tryptophan to occasion 

25 expression. 

For the first stage of growth one may employ, for example, LB 
medium (J.H, Miller, Experiments in Molecular Genetics, 433, Cold Spring 
Harbor Laboratory 1972) which contains, per liter aqueous solution, lOg 
Bacto tryptnne, 5g Bacto yeast extract and lOg NaCl. Preferably, the 
30 inoculant is grown to optical density ("o.d.") of 10 or more (at 550 
"'■'^ n -^--.-.^^Kiv n.d. Pn or more, and most preferably to o.d. 30 
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For derepression and expression the inoculant is next grown under 
conditions which deprive the cell of additive tryptophan. One 
appropriate media for such growth Is M9 (J.H. Miller, supra at 431) 
prepared as follows (per liter): 

5 KH^PO^ . 3g 

Na^HPO^ 6g 
NaCl 0.5g 
NH^Cl Ig 

Autoclave, then- add: 

10 10 ml O.OIM CaCl2 

1 ml IM MgSO^ ^ , . 

10 ml 201 glucose 

Vitamin Bl Ipg/ml 

Humkp hycase amino 
15 or DIFCO cas, amino acids 40 vg/ml. 

The amino acid supplement is a tryptophan-lacking acid hydrolysate of 
casein. 

To commence expression of the heterologous polypeptide the 
inoculafit grown in tryptophan-rich media may, eg, be diluted into a- 
larger volume of medium containing no additive tryptophan (for example, 
2-10 fold dilution) grown up to any desired level (preferably short of 
stationary growth phase) and the intended product conventionally 
obtained by lysis, centrifugat ion and purification. In the 
tryptophan-deprived growth stage, the cells are preferably grown to od 
in excess of 10, more preferably in excess of od 20 and most preferably 
to or beyond od 30 (all at 550 nM) before product recovery. 



20 



25 



All DNA recombination experiments described in the Experimental 
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I. Expression of D-polypept ide fusion protein 

A preferred method of expressing fusion proteins comprising desired 
polypeptides and, fused thereto, a portion of the amino acid sequence of 
the trp D polypeptide that is separable in vitro by virtue of a 
5 methionine amino acid specifically sensitive to CNBr cleavage, is 
described with reference to Figures 1-3. 

A. Construction of pBRHtrp 

Plasmid pGMl Fig. 1) carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G.F. Miozzari, et^ al- , (1978) 

10 Bacteriology 1457-1466)) and hence expresses a fusion protein compr ising 
the first 6 amino acids of the trp leader and approximately the last 
third of the trp E polypeptide (hereinafter referred to in conjunction 
as LE*), as well as the trp D polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 yg, was 

15 digested with the restriction enzyme PvuII which cleaves the plasmid at 
five sites. The gene fragments 2 were next combined with EcoRI linkers 
(consisting of a self complementary oligonucleotide _3 of the sequence: 
pCATGAATTCATG) providing an EcoRI cleavage site for a later cloning into 
a plasmid containing an EcoRI site (20), The 20 pg of DNA fragments 2^ 

20 obtained from pGMl were treated with 10 units T^.DNA ligase in the 
presence of 200 pico moles of the 5*-phosphorylated synthetic 
oligonucleotide pCATGAATTCATG (_3) and in 20gl T^ ONA ligase buffer 
(20m.M tris, pH 7.6, 0.5 mM ATP, 10 mM MgCl^, 5 mM di thiothreitol ) at 
4^0 overnight. The solution was then heated 10 minutes at 70*'C to halt 

25 ligation. The linkers were cleaved by EcoRI digestion and the 
fragments, now with EcoRI ends were separated using 5 percent 
polyacryl ami de gel electrophoresis (herein after "PAGE") and the three 
largest fragments isolated from the gel by first staining with ethidium 
bromide, locating the fragments with ultraviolet light, and cutting from 

30 the gel the portions of interest. Each gel fragment, with-300 

microliters O.lxTBE, was placed in a dialysis bag and subjected to 




-16- 0036776 

bag, phenol extracted, chloroform extracted and made 0.2 M sodium 
chloride, and the DNA recovered in water after ethanol 
precipitation. [All DNA fragment isolations hereinafter described, 
are performed using PAGE followed by the electroelution method just 
5 discussed]. The trp promoter-operator-containing gene with EcoRI 
sticky ends 5^ was identified in the procedure next described, which 
entails the insertion of fragments into a tetracycline sensitive 
plasmid _5 which, upon promoter-operator insertion, becomes 
tetracycline resistant. 

10 B. Creation of the plasmid pBRHtrp expressing tetracycline 
resistance under the control of the trp promoter-operator and 
identification and amplification of the trp promoter-operator 
containing DNA fragment 5^ isolated in (A.) above. 



Plasmid pBRHl (5^), (R.I. Rodriguez, et aj^.. Nucleic Acids 
15 Research 6^, 3257-3287 [1979]) expresses ampicilin resistance and 
contains the gene for tetracycline resistance but, there being no 
associated promoter, does not express that resistance. The plasmid 
is accordingly tetracycline sensitive. By introducing a 
promoter-operator system in the EcoRI site, the plasmid can be made 
20 tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered in water 
after ethanol precipitation. The resulting DNA molecule 1_ was, in 
separate reaction mixtures, combined with each of the three DNA 

25 fragments obtained in part A. above and ligated with T^ DNA ligase 
as previously described. The ONA present in the reaction mixture was 
used to transform competent E_. col i K-12 strain 294, K. Backman jet 
al_. , Proc Nat'l Acad Sci USA 21, 4174-4198 [1976]) (ATCC no. 31448) 
by standard techniques (V. Hershf ield et aj[. , Proc Nat'l Acad Sci USA 

30 71, 3455-3459 rig74]) and the bacteria plated on LB plates containing 



enzyme analysis. The resulting plasmid 8_, designated pBRHtrp, 
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expresses B-lactamase, imparting ampicillin resistance, and it 
contains a DNA fragment including the trp promoter-operator and 
encoding a first protein comprising a fusion of the first six amino 
acids of the trp leader and approximately the last third of the trp E 
5 polypeptide (this polypeptide is designated LE'), and a second 

protein corresponding to approximately the first half of the trp D 
polypeptide (this polypeptide is designated D'), and a third protein 
coded for by the tetracycline resistance gene* 

C. Cloning genes for various end-product polypeptides and expression 
10 of these as fusion proteins comprising end-product and specifically 
cleavable trp D polypeptide precursor (Figure 2), 

A DNA fragment comprising the trp promoter-operator and codons 
for the LE' and D' polypeptides was obtained from plasmid pBRHtrp and 
inserted into plasmids containing structural genes for various 
15 desired polypeptides, next- exempl if ied for the case of somatostatin 
(Figure 2) . 

pBRH trp was digested with EcoRI restriction enzyme and the 
resulting fragment 5^ isolated by PAGE and electroelut ion, 
EcoRI-digested plasmid pSom 11 (K. Itakura et al, Science 198 , 1056 

20 ( 1977); G.B. patent publication no. 2 007 675 A) was combined with 
fragment 5_. The mixture was ligated with DNA ligase as 
previously described and the resulting DNA transformed into E_. col i 
K-12 strain 294 as previously described. Transformant bacteria were 
selected on ampici 1 1 in-containing plates. Resulting 

25 ampicill in-resistant colonies were screened by colony hybridization 
(M. Gruenstein et , Proc Nat'l Acad Sci USA 72', 3951-3965 [1975]) 
using as a probe the trp promoter-operator-containing fragment _5 
isolated from pBRHtrp, which had been radioact i vely labelled with 
P . Sevei^al colonies shown positive by colony hybridization were 



containing the plasriiid OGSTgnatec; p:;vJ''' ; , »>n^c:. f,^s i:^ 

promoter-operator fragment in the desired orientation was grown in '^3 
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medium containing 10 ug/ml ampicillin. The cells were grown to 
optical density 1 (at 550 nM), collected by centrifugation and 
resuspended in M9 media in tenfold dilution. Cells were grown for 
2-3 hours, again to optical density 1, then lysed and total cellular 
5 protein analyzed by SOS (sodium dodcyl sulfate) urea (15 percent) 
polyacrylamide gel electrophoresis (J.V, Maize! Jr. et_ aj_. , Heth 
Viral 5_, 180-246 [1971]). 

Figure 3 illustrates a protein gel analysis in which total 
protein from various cultures is separated by size. The density of 

10 individual bands reflects the quantity in whicli the respective 

proteins are present. With reference to Figure 3, lanes 1 and 7 are 
controls and comprise a variety of proteins of previously determined 
size which serve as points of comparative reference. Lanes 2 and 3 
segregate cellular protein from colonies of E_. coli 294 transformed 

15 with plasmid pSom7 tZ and respectively grown in LB (lane 2) and M9 
(lane 3) media. Lanes 4 and 5 segregate cellular protein obtained 
from similar cells transformed with the plasmid pTha7 a2, a thymosin 
expression plasmid obtained by procedures essentially identical to 
those already described, beginning with the plasmid pThal (see the 

20 commonly assigned US patent application of Roberto Crea and Ronald B, 
Wetzel > filed February 28, 1980 for Thymosin Alpha 1 Production, the 
disclosure of which is incorporated herein by reference). Lane 4 
segregates cellular protein f rom ^, col i 294/pTha7 a2 grown in LB 
media, whereas lane 5 segregates cell protein from the same • 

25 transformant grown in M9 media. Lane 5, another control, is the 
protein pattern of E_. co1 i 294/pBR322 grown in LB. 

Comparison to controls shows the uppermost of the two most 
prominent bands in each of lanes 3 and 5 to be proteins of size 
anticipated in the case of expression of a fusion protein comprising 
30 the D' polypeptide and, respectively, somatostatin and thymosin (the 
other prominent band represents the LE' polypeptide resulting from 



e *' T c 1 e r. L c 0 n c 1 1 1 0 n s . 
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D. Cyanogen bromide cleavage and radioimmunoassay for hormone product 

For both the thymosin and somatostatin cases, total cellular 
protein was cyanogen bromide-cleaved, the cleavage product recovered 
. and, after drying, was resuspended in buffer and analyzed by radio- 
5 immunoassay, confirming the expression of product immunologically 
identical, respectively, to somatostatin andr-thymosin. Cyanogen 
bromide cleavage was as described in D,V, Goeddel et aj_., Proc Nat'l 
•Acad Sci USA 76, 106-110 [1979]). 

11. Construction of plasmids fo»* direct expression of heterologous 
10 genes under control of the trp promoter-operator system 

The strategy for direct expression entailed creation of a 
plasmid containing a unique restriction site distal from all control 
elements of the trp operon into which heterologous genes could be 
cloned in lieu of the trp leader sequence and in proper, spaced 
15 relation to the trp leader polypeptide's ribosome binding site. The 
direct expression approach is next exemplified for the case of human 
growth hormone expression. 

The plasmid pSom7 a2, lOwg, was cleaved w-ith EcoRI and the DNA 
fragment 5_ containing the tryptophan genetic elements was isolated by 

20 PAGE and electroelution. This fragment, 2ng, was digested with the 
restriction endonuclease Tag I, 2 units, 10 minutes at 37*C such 
that, on the average, only one of. the approximately five Taq I sites 
in each molecule is cleaved. This partially digested mixture of 
fragments was separated by PAGE and an approximately 300 base pair 

25 fragment 12^ (Fig. 4) that contained one EcoRI end and one Taq I end 
was isolated by electroelution. The corresponding Taq I site is 
located between the transcription start and translation start sites 
and is 5 nucleotides upstream from the ATG codon of the trp leader 

transcription initiation signal, anc trp leader ribosorr.e binding 
site. 





-20- 



0036776 



The Taq I residue at the 3' end of the resulting fragment 
adjacent the translation start signal for the trp leader sequence was 
next converted into an Xbal site, as shown in figure 5. This was 
done by ligating the fragment 12_ obtained above to a plasmid 

5 containing a unique (i*e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one may employ essentially any plasmid 
containing, in order, a repljcon, a selectable marker such as 
antibiotic resistance, and EcoRI, Xbal and BamHI sites. Thus, for 

: example, an Xbal site can be introduced between the EcoRI and BamHI 

10 sites of pBR322 (F. Bol i var et al- , Gene 2, 95-119 [1977]) by, e.g.. 
cleaving at the plasmid's unique Hind III site with Hind III followed 
by single strand-specific nuclease digestion of the resulting sticky 
ends, and blunt end ligation of a self annealing double-stranded 
synthetic nucleotide containing the recognition site such as 

15 CCTCTAGAGG. Alternatively, naturally derived DNA fragments may be-^ 
employed, as was done in the present case, that contain a single Xbal 
site between EcoRI and BamHI cleavage residues. Thus, an EcoRI and 
BamHI digestion product of the viral genome of hepatitis B was 
obtained by conventional means and cloned into the EcoRI and BamHI 

20 sites of plasmid pGH6 (D,V. Goeddel et^ , Nature 281^, 544 [1979])) 
to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol . extracted, chloroform extracted and ethanol precipitated. It 
was then. treated with 1 ul E. coli polymerase I, Klenow fragment 
(Boehringer-Mannheim) in 30 ul polymerase buffer (50 mM potassium 

25 phosphate pH 7.4, 7mM MgCl2, 1 mM B-mercaptoethanol ) containing 

O.lmM dTTP and O.lmM dCTP for 30 minutes at 0^*0 then 2 hr. at 37'C, 
This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 



5 



CTAGA 



5 



CTAG/V 



30 



3* 



T- 



3' 



TCT 



; ^^^^o^^^hprj c'vinq an end with two 



etnancl precipi tation) was cleaved with EccRI. The large plasmia 
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fragment _1_3 was separated from the smaller EcoRI-Xbal fragment by 
PAGE and isolated after electroelution. This DNA fragment from pHS32 
(0.2 yg), was ligated, under conditions similar to those described 
above, to the EcoRI-Taq I fragment of the tryptophan operon (-0.01 
yg), as shown in Figure 5. In this process the Taq I protruding end 
is ligated to the Xbal remaining -protruding end even though it is not 
completely Watson-Crick base-paired: 

T ^ CTAGA ^TCTAGA 



AGC TCT A6CTCT 

10 A portion of this ligation rHdction mixture was transfonried into £. 
coli 294 cells as in part I. above, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were selected, 
grown in 3 ml LB media, and plasmid isolated. Six of these were 
found to have the Xbal site regenerated via E. coli catalyzed DNA 

15 repair and replication: 

^TCTAGA ^TCTAGA 



-AGCTCT AGATCT- 



These plasmids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid 1£, desig- 
20 nated pTrp 14, vyas used for expression of heterologous polypeptides, 
as next discussed. 

The plasmid pHGH 107 (18 in Figure 6, D.V, Goeddel et al. Nature , 
281 , 544, 1979) contains a gene for human growth hormone made up of 
23 amino acid codons produced from synthetic DNA fragments and 163 
25 amino acid codons obtained from complementary DNA produced via 

reverse transcription of human growth hormone messenger RNA. This 
gene 21^, though it lacks the codons of the "pre" sequence of human 
growth hormone, does contain an ATG translation initiation codon. 



ethanol precipitation the plasTrirj was treated with SarriH: 
6. 



See r i o'j '~e 
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The human growth hormone ("HGH**) gene-containing fragment 2\_ was 
isolated by PAGE follov</ed by electroelution. The resulting DNA 
fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyline 

5 promoter-operator system so that, when subsequently cloned into an 
expression plasmid, plasmids containing the insert can be located by 
the restoration of tetracycline resistance- Because the EcoRI end of 
the fragment 21 has been filled in by the Klenow polymerase I 
procedure, the fragment has one blunt and one sticky end, ensuring 

10 proper orientation when later inserted into an expression plasmid. 
See Figure 6. 

The expression plasmid pTrpl4 was next prepared to receive the 
HGH gene-containing fragment prepared above. Thus, pTrpl4 was Xfaal 
digested and the resulting sticky ends filled in with the Klenow 

15 polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After 
phenol and chloroform extraction and ethanol precipitation the 
resulting DNA was treated with BamHI and the resulting large 
plasmid fragment 1_7 isolated by PAGE and electroelution. The 
pTrpl4-deri ved fragment V7 had one blunt and one sticky end, 

20 permitting recombination in proper orientation with the HGH gene 
containing fragment 21 previously described. 

The HGH gene fragment _21 and the pTrpl4 AXba-BamHI fragment r7 
were combined and ligated together under conditions similar to those 
described above. The filled in Xbal and EcoRI ends ligated together 
25 by blunt end ligation to recreate both the Xbal and the EcoRI site: 



Xbal filled in EcoRI filled in 



-TCTAG 



AATTCTATG 



HGH gene initiation 

— t [:ta4attc tat&- 



-AGATCITTAAi 



Ihatac 
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This construction also recreates the tetracycline resistance gene. 
Since the plasmid pHGH 107 expresses tetracycline resistance from a 
promoter lying upstream from the HGH gene (the lac promoter), this 
construction 22, designated pHGH 207, permits expression of the gene 
5 for tetracycline resistance under the control of the tryptophan 

promoter-operator. Thus the ligation mixture was transformed into E. 
coli 294 and colonies selected on L3 plates containing 5 ug/ml 
tetracycline. 

In order to confirm the direct expression of human growth 

10 hormone from plasmid pHGH 207, total cellular protein derived from 
E.coli 294/pHGH 207 that had been grown to optical density 1 in LB 
media containing 10 ug/ml ampicillin and diluted 1 to 10 into M9 
media, and grown again to optical density 1, was subjected to SDS gel 
electrophoresis as in the case of part K above and compared to 

15 similar electrophoresis data obtained for human growth hormone as 
previously expressed by others (D.V. Goeddel et al, Nature , 281 , 544 
(1979)). Figure 7 is a photograph of the resulting, stained gel 
wherein; Lanes 1 and 7 contain protein markers of various known 
sizes; Lane 2 is a control that separates total cellular protein of 

20 E. Coli strain 294 pBR322; Lane 3 segregates protein from E. Coli 
294/pHGH 107 grown in LB media; Lane 4 segregates protein from E. 
Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from 
E. Coli 294/pHGH 207 grown in LB media; and Lane 5 segregates protein 
from E. Coli 294/pHGH 207 grown in M9. The dense band in Lane 6 is 

25 human growth hormone, as shown by comparison to the similar bands in 
Lanes 2-4. As predicted by the invention, the organism E, Coli 
294/pHGH 207 when grown in tryptophan-r ich LB media produces less 
human growth hormone by reason of tryptophan repressor/operator 
interactions, and when grown in M9 media produces considerably more 

30 HGH than E. Coli 294/pHGH 107 owing to the induction of the stronger 
tryptophan promoter-operator system v_s the lac promoter-operator 
system in pHGH 107. 



p rorroter-opera tor . 
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The plasmid pHGH 207 created in the preceding section was next 
used to obtain a DMA fragment containing the control elements of the 
tryptophan operon (with the attenuator deleted) and to create a 
plasmid ''expression vector" suitable for the direct expression of 

5 . various structural gene inserts. The strategy for creation of the 
general expression plasmid involved removal of the tryptophan control 
region from pHGH 207 by EcoRI digestion and insertion in the 
EcoRI-digested plasmid pBRHl used in part L supra. pBRHl, as 
previously noted, is an ampicillin resistant plasmid containing the 

10 tetracycline resistance gene but is tetracycline sensitive because of 
the absence of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction is more particularly described 
below and shown in Figure 8/ is both ampicillin and tetracycline 
resistant, contains the tryptophan promoter-operator system, lacks 

15 the tryptophan attenuator, and contains a unique Xbal site distal 
from the tryptophan promoter-operator. The tryptophan promoter- 
operator and unique Xbal site are bounded by EcoRI sites, such that 
the promoter-operator-Xbal-containing fragment can be removed for 
insertion in other structural gene-containing plasmids. 

20 Alternatively, heterologous structural genes may be inserted, either 
into the Xbal site or (after partial EcoRI digestion) into the EcoRI 
site distal from the tryptophan control region, in either case so as 
to come under the control of the tryptophan promoter-operator system. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter 
25 containing EcoRI fragment 22 recovered by PAGE followed by 
electroelution. 



Plasmid pBRHl was EcoRI digested and the cleaved ends treated 
with bacterial alkaline phosphatase ("BAP") (1 yg, in 50 mM tris pH 8 
and 10 mM MgCl^ for 30 min. at SS^C) to remove the phosphate groups 
30 on the protruding EcoRI ends. Excess bacterial alkaline phosphatase 
was removed by phenol extraction, chloroform extraction and ethanol 
^-r--?-.>Tf .-nn The resulting linear DNA 7^, because it lacks 



35 will not itself rec^rcular^.ze, per^nuting more raciie screening ro- 
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plasmlds containing the inserts. The EcoRI fragment derived from 
pHGH 207 and the linear DNA obtained from pBRHl were combined in the 
presence of ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli strain 
5 294 as previously described, plated on LB media containing 5 gg/ml of 
tetracycline, and 12 tetracycline resistant colonies selected. 
Plasmid was isolated from each colony and examined for the presence 
of a DNA insert by restriction endonuclease analysis employing EcoRI 
and Xbal. One plasmid containing the insert was designated pHKYl. 

10 IV. Creation of a plasmid containing the tryptophan operon capable 
of expressing a specifically cleavable fusion protein comprising 6 
amino acids of the trp leader peptide and the last third of the trp E 
polypeptide (designated LE') and a heterologous structural gene 
product. 

15 The strategy for the creation of a LE' fusion protein expression 

plasmid entailed the following steps: 

a. Prevision of a gene fragment comprising codons for the 
distal region of the LE ' polypeptide' having Bgl II and EcoRI 
sticky ends respectively at the 5' and at the 3* ends of the 

20 coding strand; 

b. Elimination of the codons from the distal region of the LE' 
gene fragment and those for the trp D gene from plasmid SOM 7 a2 
and insertion of the fragment formed in step 1, reconstituting 
the LE' codon sequence immediately upstream from 

25 that for the heterologous gene for somatostatin. 

1. With reference to Figure 9(a), plasmid pSom7 a2 was Hind III 
riioested followed bv digestion with lambda exonuclease (a 5' to 

i i 1-d ^ oesled pSo:n / ^c was GissoWec ;n :.j're^-^ ^^or." 

pH 9,6, ImM MgCl^, ImM e-merc aptoe t hand j . The resulting mixture 
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was treated with 5 units of lambda exonuclease for 50 minutes at room 
temperature. The reaction mixture obtained was then phenol 
extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end 

32 

5 of the LE* gene fragment a primer pCCTGTGCATGAT was synthesized 
by the improved phosphotriester method (R. Crea £t_ aj[- , Proc Nat'l 
Acad Sci USA 75, 5755 [1978]) and hybridized to the single stranded 
end of the LE' gene fragment resulting from lambda exonuclease 
digestion. The hybridization was performed as next described. 

10 20\xq of the lambda exortuclease-treated Hind III diyestion 

product of plasmid pSom7 a2 was dissolved in 20yl H2O and combined 
with 6^1 of a solution containing approximately 80 picomoles of the 
5*-phosphorylated oligonucleotide described above. The synthetic 
fragment was hybridized to the 3' end of the LE* coding sequence and 

15 the remaining single strand portion of the LE' fragment was filled in 
by the Klenow polymerase I procedure described above, using dATP, 
dTTP, dGTP and dCTP. 

The reaction mixture was heated to 50*C and let cool slowly to 
10"C, whereafter 4^1 of Klenow enzyme were added. After 15 minute 
20 room temperature incubation, followed by 30 minutes incubation at 
37''C, the reaction was stopped by the addition of Syl of 0.25 molar 
EDTA. The reaction mixture was phenol extracted, chloroform 
extracted and ethanol precipitated. The DNA was subsequently cleaved 
with the restriction enzyme Bgl II, The fragments were separated by 

25 PAGE. An autoradiogram obtained from the gel revealed a 

32 

P-labelled fragment of the expected length of approximately 470 
bp, which was recovered by electroelution. As outlined, this 
fragment LE'(d) has a Bgl II and a blunt end coinciding with the 
beginning of the primer. 



in Figure 9, the thymosin gene contains a 3gl II site as well. 
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Plasmid pThal also contains a gene specifying ampicillin resistance. 
In order to create a plasmid capable of accepting the LE'{d) fragment 
prepared above, pThal was EcoRI digested followed by Klenow 
polymerase I reaction with dTTP and dATP to blunt the EcoRI 

5 residues. Bgl II digestion of the resulting product created a linear 
DNA fragment 33 containing the gene for ampicillin resistance and, at 
its opposite ends, a sticky Bgl II residue and a blunt end. The 
resulting product could be recircularized by reaction with the LE'(d) 
fragment containing a Bgl II sticky end and a blunt end in th^ 

10 presence of ligase to form the plasmid pTrp24 (Fig, 9b). In 

doing so, an EcoRI site is recreated at the position where blunt end 
. ligation occurred. 

With reference to Figure 10, successive digestion of •pTrp24 with 
Bgl II and EcoRI, followed by PAGE and electroelution yields a 

15 fragment having codons for the LE*(d) polypeptide with a Bgl II 
sticky end and an EcoRI sticky end adjacent its 3' coding terminus. 
The LE'(d) fragment 38 can be cloned into the Bgl II site of plasmid 
pSoni7 a2 to form an LE' polypeptide/somatostatin fusion protein 
expressed under the control of the tryptophan promoter-operator, as 

20 shown in Figure 10. To do so requires (1) partial EcoRI digestion* of 
pSom7 a2 in order to cleave the EcoRI site distal to the tryptophan 
promoter-operator, as shown in Figure 10 and (2) proper choice of the 
primer sequence (Figure 9) in order to properly maintain the codon 
reading frame, and to recreate an EcoRI cleavage site. 

25 Thus, 15 ug plasmid pSom7 a2 was diluted into 200 yl of buffer 

containing 20 mM Tris, pH 7.5, 5 mM MgCl^, 0.02 NP40 detergent, 
100 mM NaCl and treated with 0.5 units EcoRI. After 15 minutes at 
37''C, the reaction mixture was phenol extracted, chloroform extracted 
and ethanol precipitated and subsequently digested with Bgl II. The 

30 larger resulting fragment 36. isolated by the PAGE procedure followed 
by electroelution. This fragment contains the codons "L£'(p)" for 



35 upon transformation into E. coli strain 29-^, as previously describee, 
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efficiently produced a fusion protein consisting. of the fully 
reconstituted LE' polypeptide and somatostatin under the control of 
the tryptophan promoter-operator. The fusion protein, from which the 
• somatostatin may be specifically cleaved owing to the presence of a 
5 methionine at the 5' end of the somatostatin sequence was segregated 
by SDS polyacrylamide gel electrophoresis as previously described. 
The fusion protein product is the most distinct bancUapparent in Lane 
6 of Figure 11, discussed in greater detail in Part VI, infra. 

V. Creation of an expression system for trp LE' polypeptide fusions 
iO wherein tetracycline resistance is placed under the control of the 
tryptophan promoter-operator. 

The strategy for creation of an expression vehicle capable of 
receiving a wide variety of heterologous polypeptide genes for 
expression as trp LE' fusion proteins under the control of the 
15 tryptophan operon entailed construction of a plasmid having the 
following characteristics: 

I. Tetracycline resistance which would be lost in the event of 
the promoter-operator system controlling the genes specifying 
such resistance was excised* 

20 2. Removing the promoter-operator system that controls 

tetracycline resistance, and recircularizing by ligation to a 
heterologous gene and a tryptophan promoter-operator system in 
proper reading phase with reference thereto, thus restoring 
tetracycline resistance and accordingly permitting 

25 identification of plasmids containing the heterologous gene 

insert. 

In short, and consistent with the nature of the intended inserts, the 



control of a pro-Tioter-operator system. 
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Thus, with reference to figure 12, plasmid pBR322 was Hind III 
digested and the protruding Hind III ends in turn digested with SI 
nuclease. The SI nuclease digestion involved treatment of 10 ug of 
Hind IlUcleaved pBR322 in 30 ul SI buffer (0.3 M NaCl, 1 mM ZnCl^, 

•5. 25 mM sodium acetate, pH 4.5) with 300 units SI nuclease for 30 

minutes at 15'C. The reaction was stopped by the additon of 1 ul of 
30 X SI nuclease stop solution (0,8M tris base,^0 mM EDTA), The 
mixture was phenol extracted, chloroform extracted and ethanol 
precipitated, then EcoRI digested as previously described and the 

10 large fragment 45 obtained by PAGE procedure followed by 

eleclroelution. The fragment obtained has a first EccR! sticky end 
and a second, blunt end whose coding strand begins with the 
nucleotide thymidine. As will be subsequently shown, the Sl-digested 
Hind III residue beginning with thymidine can be joined to a Klenow 

15 polymerase I-treated Bgl II residue so as to reconstitute the Bgl II 
restriction site upon ligation. 

Plasmid pSom7 a2, as prepared in Part I above, was Bgl II 
digested and the Bgl II sticky ends resulting made double stranded 
with the Klenow polymerase I procedure using all four deoxynucleot ide 

20 triphosphates. EcoRI cleavage of the resulting product followed by 
PAGE and electroelution of the small fragment 42 yielded a linear 
piece of DNA containing the tryptophan promoter-operator and codons 
of the LE' "proximal" sequence upstream from the Bgl II site 
("LE'(p)'*), The product had an EcoRI end and a blunt end resulting 

25 from filling in the Bgl II site. However, the Bgl II site is 
reconstituted by ligation of the blunt end of fragment to the 
blunt end of fragment 4^. Thus, the two fragments were ligated in 
the presence of T^ DNA ligase to form the recircul ari zed plasmid 
pHKY 10 (see Figure 12) which was propagated by transformation into 

30 competent E^. coli strain 294 cellr.. Tetracycline resistant cells 
bearing the recombinant plasmid pHKY 10 were grown up, plasmid DiM 



;5 This 



mA frag.Tient 4^ contains the origin of repl icat: ion and 
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subsequently proved useful as a first component in the construction 
of plasmids where both the genes coding for trp LE* polypeptide 
fusion proteins and the tet resistance gene are controlled by the trp 
promoter/operator. 

5 . Plasmid pSom7 a2a4, as previously prepared in Part IV, could be 

manipulated to provide a second component for a system capable of 
receiving a wide variety of heterologous structural genes. With 
reference to Figure 13, the plasmid was subjected to partial EcoRI 
digestion (see Part IV) followed by Pst digestion and fragment _51 

10 'containing the trp promoter/operator was isolated by the PAGE 

procedure followed by electroelution. Partial EcoRI digestion was 
necessary to obtain a fragment which was cleaved adjacent to the 5' 
end of the somatostatin gene but not cleaved at the EcoRI site 
present between the ampicillin resistance gene and the trp promoter 

15 operator, Ampicillin resistance lost by the Pst I cut in the ap^ 
gene could be restored upon ligation with fragment 5l_. 

In a first demonstration the third component, a structural gene 
for thymosin alpha-one was obtained by EcoRI and BamHI digestion of 
plasmid pThal. The fragment, _52, was purified by PAGE and 
20 electroelution. 

The three gene fra.gments £9, Sl^ and 52 could now be ligated 
together in proper orientation, as depicted in Figure 13, to form the 
plasmid pTha7AlA4, which could be selected by reason of the 
restoration of ampicillin and tetracycline resistance. The plasmid, 

25 when transformed into coli strain 294 and grown up under 
conditions like those described in Part I, expressed a trp LE ' 
polypeptide fusion protein from which thymosin alpha one could be 
specifically cleaved by cyanogen bromide treatment. When other 
heterologous structural genes having EcoRI and BamHI termini were 

30 similarly ligated with the pHKYlO-deri ved and pS0iX7 A2A4-derivGd 
ro-^-^pnnt , tr-n if* polvQeptide fusion proteins containinq the 



cei electropho'-es 1 s sepa'^anon or total ceisular protein from ^. coii 
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strain 294 transformants, the darkest band in each case representing 
the fusion protein product produced under control of the tryptophan 
promoter-operator system. In Figure 11, Lane 1 is a control which 
segregates total cellular protein f rom coli 294/pBR322. Lane 2 

5 contains the somatostatin fusion product from plasmid pSom? a2a4 
prepared in Part IV. Lane 3 is the somatostatin-containing 
expression product of ftSom? a1a4. Lane 4 contains the expression 
product of pTha7AlA4, whereas Lane 5 contains the product expressed 
from aplasmid obtained when the pHKY-lO-derived and pSofn7 " 

10 A2A4-derived fragments discussed, above were ligated with an 

EcoRI/BamHI terminated structural gene encoding human proinsulin and 
prepared in part by certain of us. Lanes 5 and 7 respectively 
contain, as the darkest band, a trp LE' polypeptide fusion protein 
from which can be cleaved the B and A chain of human insulin. The 

15 insulin 8 and A structural genes were obtained by EcoRI and BamHI 
digestion of plasmids pIBl and pIAll respectively, whose construction 
is disclosed in D.V. Goeddel et aT_. , Proc Nat'l Acad Sci USA 76. 106 
[1979]. Lane 8 contains size markers, as before. 



* * * 



While the invention in its most preferred embodiment is 
20 described with reference to E_. col i , other enterobacteriaceae could 
likewise serve as host cells for expression and as sources for trp 
operons, among which may be mentioned as examples Salmonel la 
typhimurium and Serratia marcesans . Thus, the invention is not to be 
limited to the preferred embodiments described, but only by the 
25 lawful scope of the appended claims. 
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CLAIMS: 



1- A method of creating an expression plasmid for the 
expression of a heterologous gene which comprises the 
simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses a 
^Blectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter; 

(b) a second linear double-stranded DNA fragment 
comprising said heterologous gene; and 

(c) a third double-stranded DNA fragment which comprises 
a bacterial promoter; 

the ligatable ends of said fragments being configured such 
that upon ligation to form a replicable plasmid both the gene 
for the selectable characteristic and the heterologous gene 
come under the direction of the promoter, thus permitting use 
of the selectable characteristic in selection of transformant 
bacteria colonies capable of expressing the heterologous gene. 

2. The method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. The method of claim 2 wherein the selectable 
characteristic is tetracycline resistance and wherein the 
bacterial promoter is the trp promoter. 

4. The method of claim 3 wherein ligation reconstitutes an 
operon for the expression of ampicillin resistance as well. 

5. A method of cleaving double stranded DNA at any given 
point which comprises: 
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(b) hybridizing to the single-stranded region formed in 
step (a) a complementary primer length of single- 
stranded DNA, the 5' end of the primer lying 
opposite the nucleotide adjoining the intended 
cleavage site; 

(c) restoring that portion of the second strand 
eliminated in step (a) which lies in the 3' directicn 
from said primer by reaction with DNA polymerase in 
the presence of adenine, thymine, guanine and 
cytosine-containing deoxynucleotide triphosphates; 
and 

(d) digesting the remaining single-stranded length of 
DNA which protrudes beyond the intended cleavage 
point. 



6. The method of claim 5 wherein steps (c) and (d) are 
performed simultaneously by reaction with DI^ polymerase which 
polymerizes in the direction of 5' -^3', is exonucleolytic in the 
direction of 3' ^ 5\ but non-exonucleolytic in the direction of 5' -> 3'. 

7. The method of claim 6 wherein the polymerase is Klenow 
Polymerase I- 

8. A plasmidic expression vehicle for the production in 
cQli bacteria of a heterologous polypeptide product, said 

vehicle having a sequence of double-stranded DNA comprising, 
in phase from a first 5 * to a second 3' end of the coding 
strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding site for 
translation of element (iv) ; 
(iii) nucleotides coding for a translation start signal 
for translation of element (iv); and 
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said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E ribosome 
binding site. 

9/ The method of producing a polypeptide product by the 
expression in bacteria of a structural gene coding therefor 
which comprises: * 

(a) providing a bacterial inoculant transformed with a 
replicable plasmidic expression vehicle having a 
sequence of double-stranded DNA comprising, in 
phase from a first 5 ' to a second 3' end of the 
coding strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding 
site for translation of element (iv); 
(iii) nucleotides coding for a translation start 
signal for translation of element (iv); and 
(iv) a structural gene encoding the amino acid 
sequence "of a heterologous polypeptide; 
said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing. the transformed inoculant in a fermentation 
vessel and growing the same to a predetermined level 
in suitable nutrient media containing additive 
tryptophan sufficient in quantity to repress said 
promoter-operator system; and 

(c) depriving said bacteria of said additive so as to 
derepress said system and occasion the expression of 
the product for which said structural gene codes. 

10. The vehicle of claim 8 or method of claim 9 wherein the 
polypeptide expressed by said structural gene is entirely 
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11. The vehicle of claim 8 or the method of claim 9 wherein 
the polypeptide expressed is a fusion protein comprising a 
heterologous polypeptide and at least a portion of the amino 
acid sequence of a homologous polypeptide. 

12. The vehicle or method of claim 11 wherein said portion is 
a portion of the amino acid sequence of an enzyme involved in 
the biosynthetic pathway from chorismic acid to tryptophan. 

13. The vehicle or method of claim 12 wherein the heterologous 
polypeptide is a bioactive polypeptide and the fused homologous 
polypeptide is a specifically cleavable bicinactivating 
polypeptide. 

14. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp E polypeptide and wherein said ribosome 
binding site is the ribosome binding site for the trp leader 
polypeptide. 

15. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp D polypeptide. 

16. The vehicle or method of claim 14 wherein the fusion 
protein comprises an heterologous polypeptide and a homologous 
polypeptide which itself constitutes a fusion of about the 
first si^ amino acids of the trp leader polypeptide and the 
amino acid sequence encoded by at least about the distal 
third of the trp E polypeptide gene. 

17. The vehicle or claim 8 or method of claim 9 wherein the 
heterologous polypeptide comprises a recoverable polypeptide 
selected from the group consisting of human growth hormone, 
human proinsulin, somatostatin, thymosin alpha 1, the A chain 

Hn^np ^>,c:niin the P chain of human insulin. 
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18, The method of claim 8 wherein tryptophan deprivation is 
effected by cessation of addition of said additive and by 
dilution of the fermentation media in which said inoculant is 
first grown up. 

19. The method of claim 18 wherein the host bacteria is 
E. coli. 



20. The plasraids pBRHtrp, pSOM7A2, pHGH207, pHKYl, pSOM7A2A4, 
pThya7AlA4, and pTha7A2. 
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